On a Resampling Approach to Choosing the Number of Components in Normal Mixture Models

نویسندگان

  • G. J. McLachlan
  • D. Peel
چکیده

We consider the tting of a g-component normal mixture to multivariate data. The problem is to test whether g is equal to some speciied value versus some speciied alternative value. This problem would arise, for example, in the context of a cluster analysis eeected by a normal mixture model, where the decision on the number of clusters is undertaken by testing for the smallest value of g compatible with the data. A test statistic can be formed in terms of the likelihood ratio. Unfortunately, regularity conditions do not hold for the likelihood ratio statistic to have its usual asymptotic null distribution of chi-squared. One approach to the assessment of P-values with the use of this statistic is to adopt a resampling approach. An investigation is undertaken of the accuracy of P-values assessed in this manner.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Determination of the number of components in finite mixture distribution with Skew-t-Normal components

Abstract One of the main goal in the mixture distributions is to determine the number of components. There are different methods for determination the number of components, for example, Greedy-EM algorithm which is based on adding a new component to the model until satisfied the best number of components. The second method is based on maximum entropy and finally the third method is based on non...

متن کامل

The Family of Scale-Mixture of Skew-Normal Distributions and Its Application in Bayesian Nonlinear Regression Models

In previous studies on fitting non-linear regression models with the symmetric structure the normality is usually assumed in the analysis of data. This choice may be inappropriate when the distribution of residual terms is asymmetric. Recently, the family of scale-mixture of skew-normal distributions is the main concern of many researchers. This family includes several skewed and heavy-tailed d...

متن کامل

Model Selection for Mixture Models Using Perfect Sample

We have considered a perfect sample method for model selection of finite mixture models with either known (fixed) or unknown number of components which can be applied in the most general setting with assumptions on the relation between the rival models and the true distribution. It is, both, one or neither to be well-specified or mis-specified, they may be nested or non-nested. We consider mixt...

متن کامل

MIXFIT: an algorithm for the automatic fitting and testing of normal mixture models

We consider the fitting of normal mixture models to multivariate data, using maximum likelihood via the EM algorithm. This approach requires the specification of an initial estimate of the vector of unknown parameters, or equivalently, of an initial classification of the data with respct to the components of the mixture model under fit. We describe an algorithm called MIXFIT that automatically ...

متن کامل

THE EMMIX ALGORITHM FOR THE FITTING OF MIXTURES OF NORMAL AND t-COMPONENTS

We consider the tting of normal mixture models to multivariate data, using maximum likelihood via the EM algorithm. This approach requires the initial speciication of an initial estimate of the vector of unknown parameters, or equivalently, of an initial clas-siication of the data with respect to the components of the mixture model under t. We describe an algorithm called EMMIX that automatical...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997